PCT 



|LD INTELLECTLAL PROPERTY ORGAMZATIol 

Iniernitioni! Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) Intrrnational Patent Classification 

C07H 15/12, C12P 21/00 
C12N 15/00, I/OO 



Al 



(11) International Publication Number: WO 89/ 09219 

(43) International Publication Date: 5 October 1989 (05.10.89) 



(21) International Application Number: PCT/US89/O0795 j (81) Designated States: AU. 

(22) International Filing Date: 27 February 1989 (27.02.89) 



(31) Priority Application Number: 172,467 

(32) Priority Date; 24 March 1988 (24.03.88) 

(33) Priority Country: US 



(71) Applicant: THE GENERAL HOSPITAL CORPORA- 

TION [US/US]; Fruit Street. Boston, MA 02 1 14 (US). 

(72) Inventors: RICHARDS, Fric • 2n Dirhv Streir \VV<;t 

Newton, MA 02161 ^US). AUSUBEL, F'redericV, M. ; 
271 Lake Avenue, Newton, MA 02161 (US). 

(74) Agents: GOLDSTEIN, Jorgt. A et si.; Saidman, 
Stenne, Kessler & Goldstein i225 Connecticut Ave- 
nue, N.W., Suite 300, Washington, DC 20036 (US). 



Published 

With international search report. 



(54)Tit!e: ARTIFICIAL CHRO.MOSOME VECTOR 



(57) Abstract 



The present invention relates to a recombinant DNA rr.olecule which contains the tc'.c^r.ore of a higher eukar>'0'e 
particularly a plant, the telomere itself, a method of produciiij; a polypeptide in a recipient ceil wh, J, utilizes said recombi- 
nant DNA molecule, host cells transformed with said recomhi.iant molecule, and ures !"or said recombinant molecule 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international appli- 
cations under the PCT. 



AT 


Au5th« 


FR 


Frince 


ML 


Mall 


AU 


AustnUii 


GA 


G«bon 


MR 


MaunUnia 


BB 


Btrbtdoi 


GB 


United Kjrgdom 


MW 


Malawi 


Be 




HU 


Hungiry 


NL 


Nelherltnds 


BG 




rr 


luly 


NO 


Norwty 


BJ 




JP 




RO 


Romania 




i 










CM 




i -t 








DE 


Geim»ny, Fcdfril ^Jcp'jMic of 


LV 


Luxembourg 






OK 


Denmirt 


MC 


Mon«co 


L'S 




n 


Flolind 


MG 


Midmscir 







■I Ani: 



wo 89/09219 





PCT/US89/00795 



- 1 - 



TITLE OF THE INVENTION 



ARTIFICIAL CHROHOSCHE VECTOR 



Field ov the Invention 



This invention relates to the field of plant recombinant 
aeneti cs = 



The development of -rGcombinant DNA techniques has opened up the 
possibility of making specific changes in an organism's genetic 
makeup. Alteration of genetic endowment can lead to the development 
of useful strains of microorganisms, and more productive varieties of 
domesticated plants and animals. For example, insect resistant plants 
have recently been produced by the addition of a bacterial gene which 
instructs th2 recipient plants cells to produce a protein toxi: to 
certain types of insects (see Vaeck, et al . . Nature 328:33 ( 1987)). 
Manipulation of plant species through genetic engineering will become 
an important cofnplement to classical breeding techniques in the 
development of plant varieties with new traits, such as improved 
nutritional quality, productivity, disease resistance, and drought and 



the introduction cf these genes into the recipient organism in a form 
that allows the introduced genetic information to be read (i.e., 



Ba ckground of the Inventio n 
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expressed) and transmitted to successive generations. The basic 
techniques of molecular cloning are well established, but the neces- 
sary tools for accomplishing the efficient introduction and exprassion 
of new genetic information in higher eukaryotic organisms are still 
limited. 

Two general approaches are used to introduce new genetic inforina- 
tion into cells, a procedure commonly referred co as "genetic 
transformation." One approach is to introduce the new genetic infor- 
mation as part of another DNA molecule, referred to as a "vector," 
which can be maintained as an independent unit (i.e., an episome) 
apart from tiie chromosomal DNA molecule(s). Episomal vectors contain 
all the necessary DNA sftiiufince elements required for DNA replication 
and maintenance of thd vector within the cell. Many episomal vectors 
are available for use in bacterial cells (for example, sec Maniatis, 
T., et al ■ , Molecu lar C l oning: A Laboratory Manual (Cold Spring 
Harbor, 1982)). However, only a few episomal vectors that function in 
higher eukaryotic cells have been developed. The available higher 
eukaryotic eoisomal vectors are based on naturally occurring viruses 
and most function only in mammalian cells. In higher plant systems 
there are no known double-stranded DNA viruses that replicate through 
a double-stranded intermediate upon which an episomal vector can be 
based. Although an episomal plant vector based on the Cauliflower 
Mosaic Virus has been developed, its capacity to carry new genetic 
information is limited (Brisson et al . . Nature 310:511 (1984)). 

The other general r^ethod of genetic transformation involvoo 
integration of the introduced DNA sequences into the recipient call's 
chromosomes which permits the new information to be replicated and 
partitioned to the cell's progeny as a part of the natural chromo- 
somes. The most common ^'orm of integrative transformation is called 
"transfecti on" and is frequently used in mammalian cell culture 



usual 1^ t}ror;eri ano joini;i; tocjetiiei" ir, vaciou:. ccr,b i [i at i on s roiOfLi 
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it is integrated at random sites into the cell's chromosome (see, for 
example Wigler, at al. . Cell 11:223 (1977)). A common problem with 
this procedure is the rearrangement of introduced uNA sequences (see 
Shingo, K., et al . . Hoi. Cell. Bio l. 6:1787 (1986)). A more refined 
form of integrative transformation can be achieved by exploiting 
naturally occurring viruses that integrate into the host's chromosomes 
as part of their life cycle (e.g., retroviruses) (see Cepko, C, et 
al .. Cell 37:1053 (1984)) . 

The iTiOSt common genetic transformation method used in higher 
plants is based on the transfer of bacterial DNA into plant chromo- 
somes that occurs during infection by the phytopathcgeri ic soil bac-, 
terium Agrobacterium (see Ne.<;tif e t al . . Ann. Rev. Plant Phys. 35:387- 
413 (1984)). By substituting genes of interest for the naturally 
transferred bacterial sequences (called T-DNA), investigators have 
been able to introduce new DNA into plant cells. However, even this 
more "refined" integrative transformation system is limited in three 
major ways. First, DNA sequences introduced into plant cells using 
the Aqrobactprin m/T-nMA c.-tsm are frequently rearranged (see Jones, 
ii-lL., Mol. Gen. Ganot. 207:478 (1987)). Second, the expression of 
the introduced DNA sequences varies between individual transformants 
(see Jones et al . . EMBO J. 4:2411-2418 (1985)). This variability is 
presumably caused by rearranged sequences and the influence of 
surrounding sequences in the olant chromosome (i.e., position 
effects). A third drawback of the Ag.-obacteri um/ T-DNA system is the 
relian:.?. on a "gene addition" mGchanism: the new genetic inforn.ation 
is added to the genome (i.e., all the genetic information a cell 
possesses) but does not replace information already present in the 
genome. While gene addition is suitable for many applications, the 
ability to actually replace a specific gene with an. altered copy via 
homologous recombination (i.e., recombination between DNA of the same 
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formed cells underwent a gene replacement event in Thomas et a1 . . Cell 
44:419 (1985); see also Smithies, et al . . Nature 317:230 (1985]). 

The present invention discloses linear episomal transformation 
vectors, based on natural chromosomes, that can replicate and be 
stably maintained in higher plant cells. These artificial plant 
chromosome vectors will provide a versatile tool for qenetic transfor- 
mation of plant species and solve many of the problems associated w.th 
present DNA transformation technology. In addition, development of 
artificial plant chromosome vectors will facilitate the construction 
of artificial chromosomes that can function in other higher eukaryotic 
cells. 

Artificial chron^oicrries are man-made linear DNA molecules con- 
structed fror.. essential cis.-acting DNA sequence elements that are 
responsible for the proper replication and partitioning of natural 
chromosomes (see Hurray et al . . Nature 301:139-193 (1983)}. These 
essential elements are: (1) Autonomous Replication Sequences I'ARS) 
(have properties of replication origins, which are the sites for 
initiation of DNA replication), (2) Centromeres (site of kinetochore 
assembly and responsible for proper distribution of replicated chro- 
mosomes at mitosis and meiosis), and (3) Telomeres (specialized struc- 
tures at the ends of linear chromosomes that function to stabilize the 
ends and facilitate the complete replication of the extreme termini of 
the DNA molecule). 

At present, these essential chromosomal elements have been iso- 
lated only from lower eukaryotic species. ARSs have been isolated 
from the unicellular fungi Saccharomvces cerevi siae (brewer's yeast) 
Schizosaccharomvces pombe (see Stinchcomb et al. . Nature 282:39- 
43 (1979) and Hsiao gfc al . . J. Proc. Natl . Acad. Sci . USA 76:3829-3833 
(1979)). ARSs behave like replication origins allowing DNA molecules 
that contain the ARS to be replicated as an episome after introdur.tion 
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Functional centromaric (CEN) sequences have been purified from S. 
cerevisiae (see Clark et a1 . . Nature 267:504-509 (1980) and Stinchcomb 
et a1. . J. Holec. Biol. 158 :157-179 (1982)). Episomes carrying the 
yeast CEN sequences display proper segregation into daughter yeast 
cells during mitosis and meiosis, in contrast to ARS plasmids lacking 
a centromere. 

Telomeres, the last chromosomal element in lower eukaryotes to be 
cloned, are thought to be involved in the priming of DMA replication 
at the chromosome end (see, for example, Blackburn et al . . Ann.' Rev. 
Biochem. ^:163-194 (1984)). This is because conventional QNA poly- 
merasss are template dependent, syrithesizc ONA in the 5' to 2' direc- 
tion, and require an ol igoi'uc itotide pnmer to donate a 3' OH grouD. 
When this primer is removed, unrepl icated single-stranded gaps arise; 
most of these gaps can be filled in by priming from 3' OH groups 
donated by newly replicated strands located at the 5' end of tlia gap. 
However, the unreplicatad gaps which lie next to the extreme 5' end of 
the DNA duplex cannot be primed in this mann&r. Consequently, telo- 
meres must provide an alternative priming mechanism. 

Telomeres are also responsible for the stability of- chromosomal 
termini. Telomeres act as "caps," suppressing the recomb^nogenic 
properties of free, unmodified DNA ends (see Blackburn, supra ) . This 
reduces the formation of damaged and rearranged chromosomes which 
arise as a consequence of recombination-mediated chromosome fusion 
events . 

Telomeres may also contribute to the establishment or maintenance 
of intranuclear chromatin organization through their association with 
the nuclear envelope (sec-, for example, Fussell, CP., Genetica 
62:192-201 (1984)). 

Telomeric or tel omeric-1 i ke DNA sequences have been cloned from 
several lower eukaryotic organisms, principally protozoans and yeast. 
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(1983)). A telomere from the flagellate Trvpanosonia has been cloned 
(see, for example, Blackb-jrn et al . . Cell 38:447-457 (1984). A yeast 
telcmeric sequence has been identified (see, for example, Shampay et 
iL., Nature 310 :154-157 (1984)). 

None of the essential components, including the telomeres, how- 
ever, function in higher eukaryotic systems. For example, there have 
been numerous attempts to isolate ARSs from other eukaryot ^s by 
selecting for pieces of DNA that will serve as an ARS in yeast. While 
such DNA fragments can be readily identified, they do not promote 
evtrachromosomal replication in cells from the donor organism. 

DNA molecules ca'^rying ARSs that function in yeast cells do not 
promote extrachromosomal replication of these molecules in mouse cells 
(see Roth et al., Mol . Cell. Biol. 3:1898-1908 ( 1983)). An ARS 
sequpnced from cultivated tomcto, which operates in yeast, fails to 
function in tcmatn cells (see Zabel, P., ''Toward the Construction of 
Artificial Chromosomes for Tomato," In: M olecular Form end Functions 
P-I-iL^^ Plant Genome, (Plenum Press ( 1985) and Jongsma et al . . Plant 
Molec.,_BioK 8:383-394 (1987)). Similarly, yeast CEN sequences do not 
function when introduced into mouse (Roth et al . Mol . Cel 1 . Biol . 3: 
1378 (1983)) or Aspergillus chromosomes (Boylan, et a l . . Mol. Cell. 
Biol. 5:3621 (1985)). In addition, telomeres from the protozoan 
Letrahymena do not function in cells of the vertebrate Xenopus (Yu, et 
aK, Gene 56:313 (1987)). Finally, although researchers were able to 
show th?t a S. pombe chromosome can replicate at a reduc--d efficiency 
m mo^se cells, the centromeres of chis lower eukaryote apparently do 
not function in t[,e higher eukaryote nucleus (see Allshire et a1 . . 
Cell 50:391-403 (1987)) . 

Artificial chromosomes have been constructed in yeast using the 
three cloned essential chromosomal elements. Murray et al . . Nature 
305:189- 193 ( 1983), disclose a clonino systeoi based on the jn yj.trn 



irtificial y^azl c;;rc~GSC"os contain clcn^o g-mc., replicators, 
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centromeres and telomeres but have an impaired centromeric function in 
short (less than 20kb) artificial chromosomes. 

The ability to construct artificial chromosomes that functic^ in 
yeast, however, does not teach one skilled in the art how to construct 
an artificial chromosome using the essential elements which will 
f;inction in higher eukaryotes. The cells of higher eukaryotic 
organisms diff-r from yeast cells in ways that place many further 
demands on the functioning of their genes. The amounts of DNA in 
higher eukaryotes is large, varying between species in ways that are 
not yet completely understood. They have more DNA and their genomes 
are composed of different cl35<;es of 

l-iw.www. w»i-»<* v^*-.in- it^yiuiio 

interrupted by numerous and iengthy introns, structural sequences 
necessary for chron^oscm- sorting during cell division, and various 
kinds of repetitive sequences. 

Lower eukaryotic systems were the only source of the artificial 
chromosome assential elements until the teaching of the present inven- 
tion. It is clear that the construction of an efficient, functional 
artificial plant chromosome requires the isolation of the threa 
essential elements from pTant chromosomes, and this novel methodology 
and the elements derived therefrom sre taught by the present inven- 



tion. 



Summary of t h e Invention 

The present invention provides for the constructicn of a 
recombinant molecule which contains the telomere of a higher eukary- 
ote . 

The invention also provides for a method of producing a poly- 
peptide which comprises cloning a gene sequence capable of producing 
the polypeptide into a recombinant molecule which contains a telomere 
of a higher eukaryotp, introdncina the rprn-hip;5nt r^,.T^^.,T~^ ^ . _ • „ ; 
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This invention also discloses a novel method of isolating and 
enriching the telomeric clones of higher eukaryotic organisms by using 
genetic engineering methods in a suitable manner Jeccribed herein- 
after. 

The invention provides an alternate method for the tra.isformation 
of plant cells by constructing functional artificial chromosome 
vectors. 

Artificial chromosomes that function in higner eukaryotes have 
never been constructed. The present invention discloses for the first 

time the molecular cloning of telomeres from A.^ th ai i an; , a small 

flowering plant in the mustard family. Before now, no essential 
chromosomal element had !jf^2n isolated from a higher eukaryote. The 
cloned A. thal jana Telomere serves as a starting point for the con- 
struction of a functional artificial plant chrcrnosc^rG . 

Use of the artificial chromosome vector of the present invention 
solves i.iany of the problems associated with the current transformation 
technology used to introduce new DNA into higher eukaryotic ce'Us. 
Since artificial chromosomes are maintained in the cell nucleus as 
independently replicating DNA molecules, sequences introduced on such 
vectors are not subject to variable expression due to '"ntagration 
position effects. 

The problem of DNA rearrangement which occurs when sequences are 
introduced by transfection protocols or using the Agrobacteri um/ T-DNA 
system can be avoided with the present invention. Integrative 
transforination systems general ly involve introduction of DN'A in 
recombinogenic form (e.g. . DNA with free ends) and selection for 
transformants that splice together the incoming sequences and the host 
chromosome (see Orr-Weaver et a i . . Proc. Natl . Acad. Sci . USA 78:5354- 
6358 (1981)), Since an artificial chromosome can be delivered to the 
plant nucleus as an intact, unbroken, double-stranded DNA molecule 
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Artificial chromosome vectors can also be used to perform 
efficient gene replacement experiments. At present, gene replacement 
has not been demonstrated in a plant system and has only been detected 
at low frequency in mammalian tissue culture systems (see Thomas et 
al ■ . Cell 44:419-428 (1^86) and Smithies et al . . Nature 317:230-234 
(1985)). Ti.e reason for this is the high frequency of illegitimate 
nonhomologous recombination events relative to the frequency of 
homologous recombination events (the latter are responsible for gene 
replacement). Artificial chromosomes participate in homologous recom- 
bination preferentially. Since the artificial chromosomss remain 
intact upon delivery, no recombinogenic broken ends will be generated 
to serve as substrates for the extremely efficient illegitimats recom- 
bination machinery. Thus, the artificial chro'ros' '-.a vectors disclosed 
by the present invention will be stably maintained in the nucleus 
through meiosis and available to participate in homology-dependent 
meiotic recombination. 

In addition, because in principle, artificial chromosomes of iny 
length could be constructed using the teaching of the present inven- 
tion, the VGCtcrs could be used to introduce extremely long stretches 
of foreign DNA into cells. 

Another new and u.seful feature of the present artificial chromo- 
some transformation system is the absence of host range limitations; 
foreign DNA can be delivered to plant cells directly without the use 
of Aqrobacterium . 



Figure 1 shows sche:nati cally the procedure used for constructing 
the primary telomeric libraries which produce random, non-tel omeric 
clone: ^nd telomeric clones, and the procedure used for enriching for 

che telomeric clones produced by the two independent primary 



Description of the Figures 
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Figure 2 shows a Southern blot representation of the hybridiza- 
tion of the pAtT4 insert to Bal31-treated A. thaliana nuclear DNA. 

Figure 2(A) shows A. thaliana nuclear DNA treated with 8<il31 for 
0 (lane 1), 5 (lane 2), 15 (lane 3), 30 (lane 4), or 50 minutes (lane 
5), and subsequently digested with Mbol (left lanes) or BamHI (right 
lanes). The DNA was sized-fractionated by electrophoresis through a 
0.8% agarose gel and transferred to a nylon membrane. The membrane 
was probed with radiolabeled pAtT4 and washed at high stringency. 

Figure 2(B) shows the membrane shown in (A), rehybridized with 
radiolabeled pARR16, an A. thaliana rDNA clone, and washed at high 
stringency . 

Figure 3(A) depicts ':he chromosome configuration in a talotri- 
somic mutant. Th^ extra telocentric chromosome has two telomeres: 
one identical to that present on the full length chromosomes, and a 
new telnmara added adjacent to the centromere. A hypothetical 
rsstricticn map shown under the linear chromosomal DNAs illustrates 
the expected correlation between the telotrisomic genotype and the 
presence of a novel telomeric restriction fragmont . 

Figure 3(B) shows a Southern blot representation of three disomic 
and three telotrisomic plants from the progeny of a TrSA telotrisoi-nic 
parent probed with radiolabeled pAtT4 and washed at high stringency. 
A novel, polymorphic 15kb Oral band, present in IrSA DNA but absent 
from wildtype disomic (wt) DNA, is marked with an arrowhead. 

Figure 4 shows the DNA sequence of the pAtT4 insert. Insert 
sequences are shown in larger font and flanking vector sequGncos 
displayed in smaller font. The fusion of the blunt cloning site of 
the vertor to the telomeric sequences occurs Aflur the und2rlincd 
sequence denoted 1/2 Hindi. The Hbol sitri of the insert, which is 
joined to the BamHI site of the vector, is underlined and labeled. 
The prevalent telomeric repeat 5' -[CCCTAAA] -3' is displayed in hep- 
tameric blocks. Ths two Vdrlant rco.-at;, 5' -^rjCT'-.^l --■ -^r-. ^.-r^n.- 
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Figure 5 shows a Southern blot representation of the pAtT4 insert 



-Figure 5(A) shows Z. mays genomic ONA treated with Bal31 [0.5 
U/ml] for 0 (lane 1), 15 (lane 2), 35 (lane 3), 60 (lane 4), or 90 
minutes (lane 5), and subsequently digested with Bell. The DNA was 
sized-fract ionated by electrophoresis through a 0.7% agarose gel and 
transferred to a nylon membrane. The membrane was then probed with 
radiolabeled pAtT4 and washed at high stringency. 

Finure 5(B) shows Z. mays genomic DNA treated with Bal31 [0.25 
U/ml] for 0 (lane 1), 5 (lane 2), 15 (lane 3), 30 (lane 4), and 50 
minutes (lane 5), and subsequently digested with Taql. The DNA was 
size-fractionateu by eletlrQpl'oresis through a u.67» agarose gel and 
transferred to a nylon rr.eT.brane. The membrane was then probed with 
radiolabeled pAtT4 and washed at high stringsncy. 

Figure 6 shows a Southern blot representation of the pAtT4 
hybridization to Ba!21-trsa!;ed liu!,ian DMA. 

Figurs 5(A) shows human genomic DNA [1/ig/ml] treated with Bal31 
[1 U/ml] fryr 0 (lane 1), 5 (lane 2), 15 (lane 3), 30 (lane 4), and 60 
minutes (lane 5), and subsequently digested with Hindlll. The DNA was 
sized-fractionated by electrophoresis through a 0.7% agarose gel and 
transferred to a nylon membrane. The membrane was then probed with 
radiolabeled pAtT4 and washed at low stringency. 

Figure 6(B) shows the membrane in 5(A) rehybridized with a radio- 
labeled XenoDUs rDNA clone (X1.4DNA), and washed at low stringency. 

Figure 7 schematically shows a procedure for making minicb^omo- 
somes . 

Figure 8 schematically shows a shotgun cloning protocol for the 
functional selection of a plant AR5 sequence. 

Figure 9 schematically shows a cloning protocol for cloning plant 
centromeres using telomeric probes and telotrisomic mutants. 



hvhrlfMratinr. tn P-TJ 1 -*-rs = t=.-! 7 -:-..c r.fJ*. 
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Figure 10 shows schemes for constructing plant artificial 
chromosomes in the form of E. col i/ p1 ant circular shuttle vectors (A); 
yeast/plant circular shuttle vectors (B); and ye<i:>t/pl a;it "Mnoar 
shuttl e vectors (C) . 



I. DEFINITIONS AND BACKGROUND 

In the following description, reference will be oiade to various 
methodologies well known to tho^.e of skill in the art of recombinant 
genetics. Publications nrJ other materials setting forth such well 
known methodologies will be referred to in th« course of this descrip- 
tion, and ?re incorporated herein by reference. 

The present invention relates to the construction of artificial 
chrcrriosome vectors for the genetic transformation of plant cells, 
processes for their preparation, uses of the vectors, and systems 
transformed by them. Standard reference works settir.g forth the 
general principles of recombinant DNA technology include Lewin, B.H., 
Genes II . John Wiley & Sons, Publishers-, New York ( 1985). Cthor works 
describe methods and products of genetic engineering; see, e.g., 
Maniatis et al . . Molecular Cloning. A Laboratory Manual (Cold Spring 
Harbor Laboratory, 1982); Watson et al . . Recombinant DNA: A Shnrt 
Course (W.H. Freeman and Co., 1983); Setlcw et al . . Genet i c Engin eer- 
ing: Pri n ciples and Methods (Plenum Press, 1979); and Dillon et a1 . , 
Recombinant DNA Methodology (John W-iley i Sons, 1985). 

By "transformation" or "transfection" is meant the acquisition in 
cells of new genetic markers through incorporation of added DNA. This 
is the process by which naked DNA is introduced into a cell, resulting 
in a heritable change. 



Description of the Preferred Embodiments 



A ^ 



. . J 

u i tr ' J 



refers to living oraanisms whose 
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eukaryotes differ fundamentally in the way their genetic information 
is organized, as well as their patterns of RNA and protein synthesis. 

By the term "lower eukaryote" is meant a euk;ryotf? characterized 
by a comparatively simple physiology and composition, and unicellu- 
larity. Examples of lower eukaryotes include flagellates, ciliates, 
and yeast, for example. 

By contrast, the term "higher eukaryote" means a multicellular 
eukaryote, characterized by its greater complex physiological machan- 
iSMs as well as its ability to interact with its environment in a more K 
sophisticated manner. Generally, more complex organisms such as | 
plants and animals are included in this category. Preferred higher g 
eukaryotes to be transformed by the present invention include, for 
example, monccot and dicot angiosperm species, gymnosperm species, 
fern species, plant tissue culture cells of these species, and algal 
cells. It will of course be understood that prokaryotes and eukary- 
otes alike may be trarisfarmad by tiie methods of this invention. 

As used herein, the term "plant" includes plant cells, plant 
protoplasts, plant calli, and plant cells or tissue culture from which 
whole plants can be regenerated. 

A variety of processes are known which may be utilized to effect 
transformation; i.e., the inserting of a heterologous gene into a host 
cell, whereby the host becomes capable of efficient expression of the 
inserted genes. By "gene" is meant a DMA sequence that contains 
information for construction of a polypeptide or protein, and includes 
5' an-^ 3' ends. 

As used herein, "heterol cgous gene" is a structural gene that is 
foreign, i.e., originating from a donor different from the host or a 
chemically synthesized gene, and can include a donor of a different 
species from the host. The heterologous gene codes for a polypeptide 
ordinarily not produced by the organism susceptible to transformation 
by the expression vehicle. Another type of "heterologous gene" is an 
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gene Meful in the present invention is a mutant gene which encodes a 
he. ide-resistant form of a normany occurring enzyme. 

By "host" is meant any organism that is the recipient of a 
replicable plasmid, or expression vector. Ideally, host strains used 
for cloning experiments should be free of any restriction enzyme 
activity that might degrade foreign DNA. For example, it is 
advantageous that a prsferred E. col i host strain be defective in 
homology-based recorr.bination ( recA ) since inadvertent recombinationai 
events may threaten the integrity of the cloned DNA. The nucleic acid 
of the host should also be easily separable from that of the cloning 
vector by physical means (e.g., differential centri fugation based on 
G-C content or encapsulaMcr.) cr differ in its reactivity to alkali or 
organic acids (ohenoi). Preferred examples of host cells for cloning, 
useful in Lhe present Invention, are bacteria such as E scherichia 
col 1 , Bacillus subtil is , Pseudomon as. Strentomyces . S almonel 1 a . and 
yeast such as c°revi si ae . 

Sy "GxprciSion" is meant the process oy which a structural .:ene 
produces a polypeptide. It involves transcription of the gene into 
messenger RNA (inRNA) and the translation of such mRNA into polypep- 
tide(3) . 

A method useful for th3 molecular cloning of DN'A sequences 
includes in vitro joining of DNA segments, fragmented from a source 
high molecular weight oligonucleotide, to vector DNA molecules capable 
of independent replication. The cloning vector may include plasmid 
DNA (see Cohen, et al . . Proc. Natl. Acad. Sci . USA 70:3240 ( 1973)), 
phage DNA (see Thomas, et al . , P roc. Nat1 . Acad. Sci. USA 71:4579 
(1974)), or SV40 DNA (see Nussbaum, et al . . Proc. Natl . Acad. Sci . USA 
73:1068 (1976)). 

The term "plasmid" or "cloning vector" as used herein refers to a 
closed covalently circular extrachromosomal DNA which is able to 
autonomously renli^^tq in :> host cell and which is riormally nones- 
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aJL:., U.S. patent 4,458,464, which discloses examples of DMA plasmids). 
In a preferred embodiment, the plasmid.s used in the present invention 
are identical except for the orientation of the polyl inker (see Levin- 
son, ?t a1. J. Mol. Add!. Genet. 1:507-517 (1984)). In a preferred 
embodiment, the plasmids used in the present invention also contain an 
M13 origin which allows single-stranded versions of the plasmids lo be 
generated. 

The single-stranded, filamentous bacteriophage H13 is a we 11 - 
known cloning vector (see, for example. Messing, et al . . Pr oc. Natl . 
Acad. Sci. USA 74:3642-3646 (1977); Van Wegenbeek et al. . Gene 11:129- 
148 (1980); end Gil lam et al., f^nne 12:129-137 (1930)). .^rcferabiy, 
the small plasmids pSDClZ ind pSDClS, with cloning polyl inker, may be 



By "polyl inker" it is meant a DNA molecule, generally 50 to 60 
nucleotides long and synthesized chemically. In a preferred embodi- 
ment, this fragment contains one, or preferably more than one, 
restriction enzyme site for a blunt-cutting enzyme and a staggered- 
cutting enzyme, preferably BamHI. One end of the polyl inker fragment 
is adapted tc be ligatable to or.e end of the linear molecule and the 
other end is adapted to be ligatable to the other end of the linear 



A fragment of DNA, from any source whatsoever, may be purified 
and inserted into the plasmid at any of the polyl inker region restric- 
tion endonuclease cleavage sites. DNA to be inserted into a niasmid 
may be from prokaryotes, such 35 Escherichia c oli. Bacillus suhtili^ . 
Streptomyces lividans or the like; from viruses which infect prokary- 
otes, such as lambda, T4, C31 or the like; from eukaryotes, such as 
Saccharomyces cprejQsiae, Mus musculus . Homo sapiens , or the like; or 
from viruses which inject eukaryotes such as Herpes Simplex, SV40, or 
the 1 ike. 



used. 
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this is accomplished by incubating the plasmid in the presence of a 
restriction endonuclease such that the restriction endonuclease 
cleaves the plasmid to produce a linefr molecule. 

Endonucleases preferentially break the internal phosphodiester 
bonds of polynucleotide chains. They may be relatively unspecific, 
cutting polynucleotide bonds regardless of the surrounding .lucleotide 
sequence. However, the endonucleases v/hich cleave only a specific 
nucleotide sequence are called restriction enzymes. Restriction 
endonucleases generally internally cleave foreign DNA molecules at 
specific recognition sites, making breaks wichin "rncognitiop" 
sequences that exhibit two-fold symmetry around a given point.. 
Because the name sequence (running in opposite directions) is found cn 
both str?nd'. , sua, enzyiiies always create double- stranded breaks. 

Host often, the enzymss make a staggereri cleavage, yielding DNA 
fragments with protruding single-stranded 5' termini. Such ends are 
said to be "sticky" or "cohesive" because th&y will hydrogsn bond to 
compl emantary 3' ends. As a result, the end of any DNA fragment 
produced by an enzyme, such as EcoRI, can anneal with any other frag- 
ment produced by that enzyme. This properly allows splicing of 
foreign genes into plasmids, for example. For purposes of the present 
invention, preferred restriction endonucleases include Hindi, Hbol, 
and BamHI. 

Some of these endonucleases create fragments that have blunt 
ends, that is, that lack any protruding single strands. When DNA has 
been cleaved with restriction enzymes that cut across both strands at 
the same position, blunt end ligation can also be used to join the 
fragments directly together. 

The great advantage of this technique is that any pair of ends 
may be joined together, irrespective of sequence. 

Those nucleases that preferentially break off terminal n'jcleo- 
tides 3re referred Lg cX0"uc1 eases . For example, small doletions c^n 
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with an exonuclease which starts from each 3' end of the DNA and chews 
away single strands in a 3' to 5' direction, creating a population of 
DNA molecules with single-stranded fragments at each enu, scr.s con- 
taining terminal nucleotides. The nuclease Bal31 has often been used 
because this enzyme chews away both strands from the ends oV linear 
DNA molecules. Exopucleases to be u:;ed in thG present invention 
preferably include Bal31, SI, or ExoIII. 

Once the source DNA sequences have been cleaved and the clasmid 
linearized, they are incubated together and enzymes capable of mediat- 
ing the ligation of the two DNA molecule:; are added. Preferred 
enzymes include T4 ligase, E. col i ligase, or other similar enzymes. 
The action of thise enzymes results in the sealing of the linear DNA 
to prodiice a closed co"alently circular DNA molecule containing the 
desired fragment (see, for example, U.S. Patents '^;237,224; 4,254,731; 
4,273,975; 4,322,499 and 4,336,336). These nucleolytic reactions can 
be C'jntrol'ied by varying the time of Incubation, the temperature, and 
the enzyme concentration needed to make deletions. 

It is to be understood that the termini of the linearized plasmid 
and the termini of the DNA fragment being inserted must be complemen- 
tary in order for the ligation reaction to be successful. Suitable 
complementarity can be achieved by choosing appropriate restriction 
endor.ucleases {i.e., if the fragment is produced by the same restric- 
tion endcnuclease as that used to linearize the plasmid, than the 
termini of both molecules will be complementary). As discussed pre- 
viously, in a preferred embodiment, at least .wo classes of the 
plasmids used in the present invention are adapted to receive the 
foreign ol igonuclectirie fragments in only cne orientation. 

After joining the DNA segment to the plasmid. the resulting 
hybrid DNA (known as a chimera) can then be selected from among the 
large population of clones or libraries. 

As used herein, a "library" is a pool of random DNA fragments 
which are cloned. In prirciple, any gene can be isolated by screening 
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Young et a1 . . In: Eukaryotic Genatic Systems. ICN-UCLA Symposia on 

Molecular and Cellular Biology. VI I (New York, Academic Press), pp. 
315-331 (1977)). Each gene library may contain the DNA of a given 
organism inserted as discrete restriction enzyme-generated fragments 
into many thousands of plasmid vectors. For purposes of the present 
invention, E. col i . yeast, and Salmonella plasmids are particularly 
useful when the genome inserts come from other organisms. 

The feasibility of constructing and screening libraries frcm the 
DNA of organisms with small genomes, such as Drosphil a or yeast, has 
been demonstrated (see, for example, Weinsink et al . . Cell 3:315-325 
(1974)) and Carbon et, al . . In: Recombinant Molecule,': Imoact on 
Science and Societ y ([(aveii Press), pp. 335-378 (1977)). Handling 
libraries fr2~i. lar-jer genomes, however, is difficult because of the 
numerous tachnical limitations inherent in manipulating such a ccmplcx 
system. 

By "hybridization" is meant the pairing of complementary RMA and 
DNA strands to produce an RNA-DNA hybrid, or the partial pairing of 
DNA single strands from genetical''y different sources. 

As used herein, a "probe" is any biochemical (usually tagged in 
some way for ease of identification), used to identify or isolate a 
gane, a gene produce, or a protein. 

A "selectable marker" is a gene with a known location on a chro- 
rr.osctne and a clear-cut phenotype, permitting the identification of the 
existence of a particular sequence in which the marker has been 
attached. Use of selectable markers is described, for example, in 
Broach et al . . Gene 8: 121-133 (1979). Examples of selectable -arkers 
include the thymidine kinase gene, the cellular adenine-phosphoribo- 
syltransferase gene and the dihydryl fol ate reductase gene, and prefer- 
ably hygromycin phosphotransferase genes and neomycin phosphotrans- 
ferase genes, among others. Preferred selectable markers in the 
present invention include genes whose exprpccion confer an antibiotic 
resistance to the host cell, sufficieni- to enable the maintanarcs of a 
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the plasmid into new host cells. Of particular interest in the pre- 
sent invention are proteins conferring cellular resistance to ampicii- 
lin, chloramphenicol, tetracycline, and G-418, for example. 

II. TELOMCRE 

Based upon nuclease cleavage and ligation methods, utilization of 
plasmids with polyl inkers oriented in opposite orientation, and the 
use of an H12 helper phage, the telomeres of higher eukaryCtes, 
preferably of a plant, pay be isolated. 

« j-.^-i^iiw^ tmi^uv^ ii.K.M I. , U1I1UJI.I (v,i,cu, iiiyh iTiOieCUiar Weighl 

nuclear DNA from any source whatsoever may be treated briefly with an 
exonuclease to Cie£.ta olunt ends. (Source DNA may be from prokar- 
yotes, eukaryotes, or viruses which infect prokaryctas or eukaryotes, 
as previously detailed.) The blunt ends of the fragmented DMm are 
then ligated to 5 small E. coli plasmid cloning vector, for example, 
which has been previously digested with two different restriction 
endonucleases to generate one blunt end and one scicky end. The 
ligation products are then digested with the restriction endonucle- 
ases, leaving comp?'-.ible ends. Next, the restriction fragments are 
ligated to form circles which are then propagated in an E. coli host, 
for example. 

Two independent primary libraries are constructed in this 
fashion; one using pSDC12, for example, and the other using the com- 
plementary vector, pSDC13. 

The primary libraries will contain inserts corresponding to 
sequences that lie adjacent to the ends of the large genomic DNA 
fragments. Since chromosome size DNA cannot be isolated, only a small 
fraction of the inserts are derived from telomeric ends; the remainder 
will reprc-ser.t sequences that lie next to random, shear-generated 
ends. Thus, tnn ra^io of tpln^nnV tn r-^r^r'nr^ . f.^..^^;^,, 
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r^eric clone can be obtained at a frequency of 10'^ in each of the 
primary libraries. Each library should be composed of about 5000 
members in order to ensure that each primary library has at least one 
telomeric clone. 

To increase the representation of telomeric clones, plasmids 
containing inserts that are present in both primary libraries must be 
Identified and saved, and random clones, isolated from broken en-^s, 
discarded, to form a new collecticr. of clones rafarred to 3i, the 
secondary library. This procedure will save telomeric clones in which 
the primary libraries were larne enough to ensure that each containea 
2t least one telomeric clone. 

Although methods for the enrichment of desired CNA sequences 
prior to clonir.g ar e known (see, for example, Tilghman et a1 . . Proc. 
Natn Acad Sci. USA 24:4406-4410 (1977)): the novel enrichment pro- 
tocol of the prf^sent Invention capitalizes on the underrepresentaticn 
of ncrjtelcmeric sequences in the two primary libraries. Random, non- 
telomsric single copy sequences will not be represented in both 
primary libraries due to the small insert size (average =■ 0.3kb) and 
the small mem.ber size (about 5,0C0/prinary library) of the primary 
libraries. Each primary library can represent, at most, only 15,000kb 
(5,000 X 0.3kb) of a 70,OUOkb haploid genome. Because the probability 
of cloning any particular DNA sequence independently from random 
chromosomal ends in both primary libraries is quite low, ncntelomeric 
sequences will be underrepresented and enrichment for telomeric clones 
can thus be achieved by saving clones with inserts corrjnon to both 
primary libraries (see Table 1). 

To as'emhlp a <;prnnrlarv lihraru c i nnl q_ v-clr^r^^^rl nMA 

first made by infection of both primary libraries with an M13 helper 
phage, for example By utilizing plasmids with polyl inkers oriented 
in opposite directions, infection with the H13 bacteriophage can 
separate the doubl e- stranded ONA into two sets of connl enient arv 
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and allowed to anneal. Since the M13 system packages only one strand 
of the plasmid, the vector sequences can not hybridize to themselves. 
A plasmid can only cross-hybridize to another single-stranded plaomid 
if the inserts of these plasmids are homolcgous and represent comple- 
mentary strands. 

The reassociation of two complementary sequences of DNA occurs by 
base pairing. Renaturation of DNA depends on random collision of the 
complementary strains; it therefore follows second order kinetics. 
This means that the rate of reaction is governed by the concentration 
of DNA that is single-stranded at any particular time, ana the time of 
incubation. In a preferred embodiment, the concentration of such 
single-stranded plasmids will te 400 /ig/ml . 

The single-:,LrandeJ plasmids with complementary DNA inserts 
cross-hybridize to form a double-stranded DNA section and the single- 
stranded plasmids v.'ithout such cross-hybridizing inserts remain as 
single-strandad DNA molecules. Hydroxylapatite chromatography, which 
distinguis.hes between single- and double-stranded DMAs, can be used to 
purify cross-hybridizing single-stranded plasmids (see, for example, 
Britten, et al . , Methods Enzvmo l . 29:363 (1974)). After isolation, 
the cross-hybridizing plasmids may be transformed into j recombina- 
tion-deficient E. co l i host, for example. 

The secondary library is thereafter screened in order to identify 
the clones that contain the specific hybrid DNA, or telomeric insert, 
desired. The screening of restriction fragments may be accomplished 
by a variety of methods; for example, gel electrophoresis, Southern 
blotting, buoyant density, etc. In a preferred embodiment, this 
screening for telomeric sequences may be correlated to ^xcnuclease 
sensitivity. Telomeric DNA sequences are preferentially sensitive to 
exonuclease attack since they always reside next to a DNA end. (See, 
for example, Yao, et al . . Proc. Natl. Acad. Sci . USA 78:7436-7439 
(1981)). Nontelomeric sequences are degraded by exonuclease treatment 
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In a preferred embodiment, the screen may be conducted using 
Southern blots (see Southern, J. MoT . Biol . 98:503 (1975)) prepared as 
follows. Source nuclear DNA can be digested with an eXcr.Liclease for 
various lengths of time (see Figure 2) and subsequently digested with 
a restriction endonuclease, size-fractionated by agarose gel electro- 
phoresis and transferred to nylon membranes. These Southern blots may 
then be probed with plasmid DMAs (pooled five at a time) prepared from 
clones in the secondary library. Plasmids which hyoridize to restric- 
tion fragments which shift electrophoretic mobility upon increasing 
exonucleolytic digestion are telomeric clones. 

Using this cloning and enrichment method, a telomeric clone which 
hybridized to exonuclease -sensitive restriction fragments was isolated 
from A. thalian ^. T^'s plasmid, designated pAtT4, was discovered to 
be described by the DNA sequence 5' -[CCCTAAA] -3' (see Example 5). A 
variant repeat was also found consisting of tandemly repeated blccks 
of the base sequence 5' - [CTCTAAA] -3 ' . The pAtT4 plasmid was deposited 
under the terms of the Budapest Treaty at the American Type Culture 
Collection, Rockville, Haryland^ before tue filing date of this 
application, and assigned Accession Number ATCC 67577. It is 
described more fully in Figure 4. 

III. A UTONOMOUS REPLICATING SEQUENCE 

Prior attempts to isolate functional higher eukaryotic replica- 
tion origin"? have been largely unsuccessful. Some have attempted to 
isolate these ARSs by selecting for ARS function in yeast (see, for 
example, Ross et al . . Hoi . Ce 11 . Biol . 3: 1898-1908 (1983); Stinchcomb 
et al. . Proc. Natl. Acad. Sci . U SA 77:4559-4563 ( 1980)). Unfor- 
tunately, these "ARS" sequences do not function to support extra- 
chromosomal replication whan introduced back into the natural host 
organism (:ee Jongsma nt al . . PI ant Hoi . Biol . 8:383-394 (1987)). The 
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replication origins relies on functional tests for extrachromosomal 
replication in plant cells. 

The present invention discloses a novel method for isolatiriy the 
ARS sequences which involves the formation of minichromosome deriva- 
tives of natural chromosomes (see Figure '/) . It has been demonstrated 
in yeast that inverted repeats of telomeric sequences are "reso'ived" 
by an unknown mechanism which results in a double-stranded cleavage 
between the inverted repeats (see, for example, Szostak, r.o'\d S prii:'^ 
Ha rbor Symp. Quant. Biol . 47:1187-1193 (1983)). After an inverted 
telomere repeat is introduced into a chromosome, a resolution reaction 
will lead to scission of tho chromosome and formation of two chromo- 
somal fragments, each wiih two telomeres (see Hurray et a1 . . Ce"!! 
45:529-536 (I98f)). This process generates a minichromosome small 
enough t.o be isolated intact (<lMb) allowing further manipulation by 
in vitro techniques to delimit the sequences responsible for autono- 
mous replication. Example 11 details illustrative experimental proce- 
dures to be followed. 

A second novel method disclosed by the present invention for 
isolating functional plant origins is a shotgun cloning approach (see 
Figure 8). Higher eukaryotic organisms have many replication origins 
distributed throughout their genomes. The A. thaliana genome contains 
approximately 1000 origins spaced every 70 kb along the chromosome 
(see Van't Hof et al . . Chromosoma 58:269-285 (1978)). Therefore, it 
is quite reasonable to look for random fragments of genomic DNA from 
A. thaliana which oromote extrachromosomal replication. The details 
of performing such a shotgun cloning strategy are presented in Example 
12. 
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IV. CENTROMERE 

As previously discussed, a centromere is the pr-,mar> ccr.str iction 
of the chromosome and the site of kinetochore assembly (see Llackburn 
et a"*. . Ann. Rev. Biochem. 53:163-194 (1984); Clarke et al . . Ann. Rev. 
Genet. 19:29-55 (1985)). The kinetochore is the physical structure 
that mediates the attachment of the spindle fibers to the chromosome 
and is therefore responsible for the proper partitioning of th'^ chro- 
mosomes at mitosis and meiosis. The DNA sequences found at the cen- 
tromere (CEN sequences) presumably play a role in specifying 
kinetochore assembly, 

CEN sequences have Lasn isolated from S. cerevisi^e and Schi z o- 
saccharonyc^s ounine. The functional CEN sequences of S. cer&visiae 
are less than 2G0 bp long and consist of three conserved sequence 
elements (see Clarke, supra) . ine functional CEN sequences in 3 . 
pomba have not yet becin defined precisely (see Nakaseko et al . , EMBQ 
5:1011-1021 (1985)) . 

CEN sequences from a higher eukaryotic system have not been 
isolated. In contrast to S . cerevisiae yeast kinetochores , which 
cannot be seen under the microscope, kinetochores from higher eukaryc- 
tes are extremely largo (up to 0.7 microns), indicating that higher 
eukaryotic CEN sequences constitute long stretches of DNA (see, e.g. , 
Bloom et al . . C ell 29:305-317 (1982); Ris et al . , Chromosoma 82:153- 
170 (1981); Sakai-Weda, A., Cvtolcgia 48:253-258 (1983)). 

The cloning strategy for isolation of plant CEN sequences used in 
the present invention capitalizes on the availability of cloned telo- 
meric sequences and telotrisomic mutants of A. thaliana that possess a 
telomere in a new chromosomal location (sea Figure 9). 

Telotrisomics are a special type of trisomic mutant that contain 
an extra chromosome arm in the form of a telocentric chromosome. 
Kocrnneef et a 1 . . Gr^nnUca 61:41-46 ( 1983) iholatpd Tr5A, the telotri- 
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arise from primary trisomic lines as a result of breakage of the extra 
chromosome when it exists an unpaired univalent during meiosis; the 
breakage is thought to occir at, or close to, the centromere (Dar- 
lington, J. Genet. 37: 341 (1939)). The telocentric is formed by a 
"healing" event which leads to addition of a new telomere at the site 
of breakage. 

A telotrisomic mutant has two extra telomeres in each ncleus: 
1) the telomere on the distal tip of the extra chromosome arm which is 
identical to the telomere on the corresponding full length chrorrosome 
and 2) a new telomere adjacent to the centromere which has no codntc"-- 
part in the normal cn.-omosome complement. 

The new IfiloT.ere in the telotrisomic mutant Tr5A may be detected 
as a restriction fragment length polymorphism (RFLP) on Southern blots 
comparing telotrisomic genomic DNA and wild type disomic genomic CNA 
which have been probed with the cloned A. thai iana telomere (see 
Example 7) . 

In a preferred embodiment, once a telotrisomic RFLP has been 
identified, a genomic library may be constructed to clone the restric- 
tion fragment representing the CEN-telomere fusion (see Example 8). 

Several tests can be used to determine if the clones isolated by 
this procedure represent A. thaliana CEN sequences. First, the puta- 
tive CEN sequences are used as hybridization probes on Southern blol-; 
of all known telotrisomic mutants. If the probe represents authentic 
CEN sequences, it recognizes different RFLPs in each telotrisomic 
mutant, corresponding to the different CEN-telomere fusion events. 

Functional assays for CEN sequences may also be used. ThG 
development of acentric autcnomously replicating plant vectors, pre- 
viously discussed, effectuates assays of potential CEN sequences for 
partitioning function. The acentric vectors are lost at a high fre- 
quency in the absence of selection due to the nondisjunction of repli- 
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stable maintenance of the extrachromosomal vectors (see Hsiao et a1 . . 
Pro c. Natl. Acad. Sci. USA 78:3760-3764 (1981)). 

Another functional test for CEN activity which may be us^d in the 
present invention is the creation of dicentric chromosomes (see Mann 
et a1. . Proc. Natl. Acad. Sci. USA 80:228-232 (1983)). Dicentric 
chromosomes are broken during anaphase when the two centromere 'kineto- 
chores become attached via spindle fibers to opposite poles. The 
breakage often leads to loss or rearrangement of the effected ch'-omo- 
some. To perform a dicentric assay, putat.ve CEN sequ £ M c s s can be 
integrated into plant chromosomes using the Acrobacteri um/ T-Di,'A 
system. The introduced sequences which function as a centromere cause 
a dicentric chromosoH'^ i-o be created. This chromosome undergoes 
breakage, fo''lowed by rearrangement or chromosome loss; these Gvents 
can be detected by karyotype analysis (i.e., microscopic examination 
of the chromoscmes) and/or the loss of marker genes introduced along 
v-fith the CEN seq^iances. 

A different sxperiiTiental approach to assaying putative CEN 
sequences involves charh-^tenzation of proteins which bind to these 
sequences. Such proteins can be purified by affinity chromotography 
using the putative CEN sequences as a binding substrate (see Kadcnaga 
et a1 ■ , Proc. Natl . Acad. Sci. USA 83:5889-5893 (1986)). Antibodies 
raised against the proteins are then used to localize the antigen on 
the chromosome using in situ techniques. Authentic CEN bindin^j pro- 
teins are thus localized at the centromere. 

V. CONSTRUCTION OF CHROMOSOMES 

Having now disclosed novel recombinant molecules and methods for 
preparing them so as to obtain functional essential chromosomal ele- 
ments, it will be possible for those of ordinary skill in the art to 
construct such artificial chromosomes. Useful construction methods 
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tion to the essential elenients, a selectable plant marker (e.g., 
antibiotic resistance gene), and a cloning site for insertion of 
foreign DNA must be included. In order to propagate the vectors in E^. 
col i . it is necessary to convert the linear molecule into a circle by 
addition of a stuffer fragment between the telomeres, as indicated in 
Figures 8 and 10. An E. coli plasmid replication origin and selec- 
table marker must also be included. 

Artificial plant chromosomes which replicate in yeast may also be 
constructed to take advantage of the large insert capacity and stabil- 
ity of repetitive DNA inserts afforded by this systeni (see Burks et 
al . . Science 236:805-812 (1987)). In this case, yeast ARS and CEH 
sequences are added to the vector. The artificial chromosor.e is 
maintained in yeast as - circular molecule using the stuffer fragment 
to separate the telomeres. 

VI. PLANT CELL TR A NSFORMATION 

Deproteinized DNA nay be introduced into plant cells by the 
following several methods known in the art. For example, Shillito gt 
al-, Bio technolog y 3:1099-1103 (1935) detail a protocol for the stable- 
transformation of plant protoplasts at frequencies of up to 2% using 
electroporation. 

Other methodologies are also useful in the present invention. 
Microinjection techniques are disclosed in Crossway et al . . Hoi. Gen. 
Genet. 202:179-185 (1985). Microprojecti 1 e delivery may also be used 
(see Klein et al . . Nature 327:70-73 (1987)). 



Having now fully described the present invention, the same will 
be more clearly understood by reference to certain specific examples 



EXAMPLES 
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The following experimantal parameters were utilized in all man- 
ipulations. 

Plasmids and Probes 

Poly d(CA/6T) was purchased from Boehringer Mannheim Biochemi- 
cals. pSDClZ and pSDC13 are described in Levir.son et al . . J. Mol . 
Appl. Genet. 1:507-517 (1984). pKDRI and pKDR2 contain the joybean 
rRNA genes on 3.75 and 3.9kb EcoRI fragments respectively (Lchenrcde 
et a1.. J. Hoi. Evol. 11:259-269 (1935)). pARRIS ard pARR12 contain 

2.5kb and 180bp A.^ thai iana EcoRI fragments consisting of rONA and 

rDNA spacer DNA clcne-J in pUC12 respectively; pARR20-l contains a 
180bp HindlH fr£g..,enl corresponding to the 180bp tande.-ii repeat family 
described by Hartinez-Zapater et al . . Hoi. Gen. Genet. 204:417-423 
(1986), cloned in pUr.l2; and X1.4DNA is a Xenopus rDNA clone (Maden et 
iL., Nuc l. Acid Res. 6:817-830 (1979)). All DNA was prepared by 
standard procedures (Ausubel et al. . Current Protocols in Mo1:;cui ar 
Biology (New York: Jonn Wiley & Sons) (1887)). Radiolabeled probes 
were prepared either by nick translation or the random priming method 
using kits purchased from Boehringar Mannheim Biochemical s. 

Preparation of Genomic DN'As 

A. thaliana nuclear DNA (average size of approximately lOOkb) was 
prepared as described by Hamilton et a1 . . A nn. Biochem. 49:48 (1972). 
Genomic DNA from the A. thaliana Landsberg telotrisomic mutant Tr5A 
and its disomic siblings (Koornr.eef et a 1 . . Genetic a 61:41 (1983)) was 
prepared by the miniprep procedure of Dellaporta et al . . PI ant Mol . 
Bio I. Rptj;:^ 1:19-21 (1983). Genomic DNA preparation from all other 
plant species was performed as described in Ausubel et al . , supra 
(1937). Hu-an DNA was preoared by lysis of monolayers of HeLa cells 
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were phenol -extracted twice and chloroform-extracted once. The 
genomic DNA was then collected by ethanol precipitation. 

Source of Enzymes 

All restriction enzymes were purchased from New England Biolabs, 
Boehringer Mannheim Biochemical, or International Biotechnol cgia? , 
Inc. Bal31 and T4 DNA Ligase were purchased from New England Biol Lbs. 

Example 1 
Construct icn of Primary Library 

High molecular weight genomic DNA was prepared and treated with 
Qal31 exonuclease to remove approximately 10 bps from every double- 
stranded end. The exonucleass digestion ensured that Bal31- 
susceptible terminal structures at the telomere (such as a hairpin) 
would ht removed and tha^. a blunt end suitable for ligation would be 
created. 

The Bal31-treated blunt ends of the genomic DNA were ligatsd 
independently to the plasmid cloning vectors pSDC12 and pSDC13 that 
had earlier been digested with Bam.HI and Hindi (which cleave in the 
polyl inker, leaving one blunt Hindi end and one sticky BamHI end) to 
generate linear vector uNAs with non-coripatible ends. The ligation 
products were then digested with BamHI and Kboi to generate linear 
chimeric DNAs with compatible ends (Mbol, which recognizes the 
sequence 5'-GATC-3', cleaves genomic DNA frequently (about every 300 
bp) leaving BamHI compatible ends; Hbol does not cleave within the 
vector sequences because pSDClZ and pSDC13 were purified from a dam"'' 
host). The linear chimeric DNAs were circularized by ligation at low 
DNA concentration and transformed into an F"*" recA E. col i host (JH109) 
' - f-^y-"i op,-! ncpri'' rirn p^nr]^ n*'im^ry librnrv. 
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DNA concentration of 10 pg/ml at 30*C as described below. It was pre- 
viously determined that under these conditions the rate of Bal31 
exonucleolytic "digestion was approximately 15 bp/miri/end. pSDC12 and 
pSOClS were digested separately with Hindi followed by digestion with 
BamHI; the DMAs were precipitated with spermine and the 5' phosphates 
were removed with calf intestinal phosphatase. (The phosphatase step 
was omitted when constructing the yeast primary libraries, resulting 
in a high number of plasmids without inserts.) Ihe linearized plasmid 
DNAs were ligated to the BalSl-treated yeast or A. thaliar.a genomic 
DNAs with T4 DNA Ligase at 16'C using 20,000 U/ml of enzyme, a 10 foV-' 
molar excess of wpctcr to uancmic DNAs, and a DfiA concentration of 130 
/xg/ml . The ligaticf! products were digested with Mbol and BamHI and 
circularized with 74 DNA Ligase at 4*C using 80 U/ml of enzyme at a 
DNA concsntration of 1.3 «g/ml . Transformation of the circularized 
DNAs into competent JM309 cells (Strat-agene) generated two primary 
libraries for both A. thalia na and yeast. 



Colonies representing both primary libraries (yeast primary 
libraries contained 474 (pSDCl?) and 541 (pSDC13) colonies (see Table 
1); A. thaliana primary libraries contained 5500 (pSDC12) and 5700 
(pSDC13) colonies) were scraped from agar plates, grown en masse in 
liquid culture, and infectpd at a multiplicity of infoction of 10 with 
the H13 helper phage rvl (Levinson et a1 . , J. Hoi . App. Genet. 2:507- 
517 (1984)). The infected cultures were diluted 100-200 fold, grown 
10-14 hr at 37*C, and phage particles were purified from the super- 
natants as described in Levinson et al . , suura (1984). Phenol extrac- 
tion and ethanol precipitation of the purified phage yielded single- 
strand n rl n N' A " p " r r- " r, t; -j fi g Q 2 c h p T i !T! 2 r y library. 



Example 2 

Assembly of Secondary Telomeric Libraries 
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anneal in 25 mM phosphate buffer pH 6.8 at 42'C for 3 hr (approxi- 
mately 10 X Coti/2 for the A. thaliana primary libraries). Cross- 
hybridizing single-stranded DNA molecules were isolated as partially 
double-stranded DNAs using hydroxyl apatite chromatography as follows: 
0.2 g of hydroxyl apatite (BioRad HTP) hydrated in 100 mM K+ phosphate 
buffer, pH 6.8, and heated to lOO'C for 30 min (Martinson et al . , 
Anal. Biochem. 61:144-154 (1974)) was equilibrated in 25 mH phos- 
phate buffer, pH 6.8, and poured into a water- jacketed column (0.7 :;n 
inner diameter) that was maintained at 42'C. The DNA hybridization 
solutions were diluted into 0.5 mi of 25 mH K+ phosphate buffer, pH 
6.8, and applied to Iho Cjlu-n. Single-stranded DNA was elated by 
washing the column with o ml of 100 mM phosphate, pH 6.8, at 42*0 
followed by 3 ml of the same buffer at 50"C. Partially double- 
stranded DNAs were eluted from the column with /'.-4 ml of 20Q mM K"^ 
phosphate buffer^ p'i 5.8, at 50*C. The fractions contain^.ng the 
partially double-stranded plasmids were concentrated by sec-butanoi 
extraction followed by a chloroform extraction; the phosphate buffer 
was exchanged with 10 mM Tris-HCl pH 8.0, 1 miM EDTA, 100 mM NaCl using 
Sephadex G-50 (Pharmacia) spin columns, and the DNA was ethanol -pre- 
cipitated. Secondary libraries wore formed by transforming HBlOl with 
the purified cross-hybridizing single-stranded plasmids using the 
protocol developed by Michael Scott (personal communication). 

The A. thaliana secondayy library was screened to identify 
abundant non-telomeric repetitive DNA clones by colony hybridization 
(described below) using a mixed proba containing the inserts of 
pARRlZ, pARR20-l, pKDRl , and pKDR2. Colonies which did not hybridize 
to this mixed probe were savad in ordered arrays in 96 well microtitsr 
dishes at -80*C in LB media containing 15% glycerol. 

For characterizations of the yeast primary and s?-. :ndary li- 
braries, colony hybridizations were performed using nitrocellulose 
T ^ o y-n i r-r, T n o p r o T T 1 'i ^ fit Gnipstpln pt qI., PNAS USA 
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sulfate, IX P Buffer (New England Nuclear, Gene Screen protocols), and 
25 /ig/ml E. coli DNA at 60-65*C. The filters were washed in 2X SSC, 
0.5% SDS at 50-55'C. The A. thaliana secondary library was griddad 
out on large (22 X 22 cm) agar plates and transferred to uncharged 
nylon membranes (BioTrans, ICN). The colonies were lysed by autoclav- 
ing the membranes for 1 min, and the hybridizations (using pAtT4 
insert probe) were carried out at 55*C using 1 H NaCl , 10% dextran 
sulfate, 1% SDS, 25 /ig/ml tRNA, and a probe concentration oV 10^ 
cpm/nl. The filters were washed at 60*C with 0.2X SSC, 0.1% SDS. 



To screen the A. thai i ana secondary library, liquid cultures 
inoculated from tlie microtiter dishes were pooled in groups of 5-5 and 
the plasmid DNAs purified by the alkaline lysis miniprsp procc'Jure 
(Ausubel et al . . su^jra (1987)). Tha plasmid pools were radiolabeled 
by nick translation using --P-dCTP (New England Nuclear) and used to 
probe genomic Southern blots (Haelll digests) of A. thaliana nuclear 
DNA which had been previously digested with Bal31 nuclease for 0, 7, 
and 30 min under the following conditions: 

High molecular weight A - thaliana nuclear DNA was digested with 
0.25 U/ml Bal31 nuclease at 30'C at a DNA concentration of 10 /(g/ml in 
12 rr-M CaC;2, 24 mM HgCl2, 0.2 M NaCl, 20 mH Tris-HCl pH 8.0, 1 mV. 
EDTA, and 100 /ig/ml BSA. Moui f ications or these parameters usf d rot 
Bal31 digestion of Z. mays and human genomic DNA are indicated in the 
appropriate figure legends. Reactions were stopped at the indicated 
tim!is by addition of EGTA to a final concentration of 20 mH, extracted 
with phenol and cnloroform; and precipitated with ethanol . 

Southern blotting zr.d hybridization v.'crc carried cut with nylon 



Example 3 



Sc/eering of Secondary Teloir.eric Libraries 
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hybridlzations were Jone 1n 0.5 H NaH?04, pH 7.;'., 7% SDS, 1 mM EDTA, 
and 1% BSA (Church and Gilbert, supra (1984)) at 50'C using a probe 
concentration of 10^ cpm/ml . Filters were washed under the follc.ing 
conditions: Low stringency - 2X SSC, 1% SDS at 60'C; high stringency 
- 0.2X SSC, 0.1% SDS at 60*C. 

In this case, the blots were washed under in 2X SSC, 1% SDS at 
55*C. Plasmid pools that hybridized to Bal31 nuclease-sensitive bands 
or bands that ran at the limit of mobility were split into individual 
clones to identify potential telomeric clones. 



To identify A. th a liana telomeric clones, the screening procedure 
described in Example 3 was used. One clone, designated pAtT4. was 
found which hybridized to exonuclease-sensitive restriction fragmentr. 
This plasmid was isolated as a stable derivative of a larger p1<iSMid 
that suffered spontaneous deletions. 

Figure 2 shows a Southern blot of Bal31-treated A. thaliana 
nuclear DNA which was probed with radiolabeled pAtT4. The lanes at 
the left in Fig. 2A contain Bal31-digestGd nuclear DNA which was 
subsequently cut with Mbol. In the non-Bal31-treated lane, pAtT'^ 
hybridized to a smear representing restriction fragments ranging from 
2.0 to S.Okb in length. The size and intensity of the heterodi sperse 
fnbol band decreased upon increasing exonuciease treatment. The rate 
of exonuciease digestion was linear and approximately 2.5kb of 
sequence had to be removed to eliminate most of the pAtT4 hybridiza- 
tion. 

The righthand lanes in Fig. 2A show the hybridization pattern of 
pAtT4 to 3a-H; -digsstcd GalBl -treated genomic DhA. In this case, the 
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Identification of pAtT4 Telomeric Insert 
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hybridized to a coVicCtion of restriction fragments that were hetero- 
geneous in size. The BamHI bands become smaller and eventually dis- 
appeared with increasing Bal31 digestion. 

To demonstrate that the exonuclease sensitivity of pAtT4 homolo- 
gous saquences was not a property of genomic sequences in general, the 
filter shown in Fig. 2A was reprobed with A. thaliana ribosonal DNA. 
The results (Fig. 2B) show that the rRNA gene probe hybriciized to 
several bands, none of which shifted mobility due to exonuclease 
treatment. 

Exampig 5 

St. .juencirig of pAtT4 Telomaric Insert 

The pAcT4 insert was sequenced by the dideoxy method using 
clGuble-strandsd teniplate following the protocols described in Ausubel 
9t al. , sujDra (1987). The simple sequence insert template caustd 
pausing of the reverse transcriptase, making it difficult to read 
sequence far from the primer; for this reason, both strands of the 
insert were not sequenced in their entirety. The sequence of tho 
telonisric repeats w=s confirmed by sequencing pAtT4 using the chemical 
method of Maxam and Gilbert, Meth. Enzypol . 65:4SS-5SQ (1980). 

The DNA sequence of pAtT4 insert is shown in Fig. 4. The insert 
is composed almost entirely of tandemly repeated blocks of the se- 
quence 5'-[CCCTAAJ\]-3' . Two variant repeats, 5'-[CTCTAAA]-3' , were 
found in tandem in the middle of the insert. The simple sequptico 
repeated DNA abuts the blunt end cloning sits of the vector (noted as 
1/2 Hindi in Fig. 4); this end of the insert corresponds to the 
terminus of the chromosome. It should be kept in mind, however, that 
since the genomic DNA was initially treated with Bal31 to remove a 
small number of nucleotides from the ends, before the cloning was 
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From the telomeric end of the insert, the tandemly repeated 
sequence extends 386 bp, comprising 55 complete repeat units. The 
orientation of the repeats is such that the C-ricn strand points 
towards the centromere when read in the 5' to 3' direction. In addi- 
tion, tho insert contains 17 nucleotides of a nonrepeat flanking 
sequence which includes the Hbol site used to clone the terminal 
restriction fragment. 

The discrepancy between the size of the pAtT4 insert (403 bp) -nd 
the teiomeric Hbol restriction fragments detected on Southern blots 
(2.5-5.0 kb) (see Fig. 2A) is most likely due to internal deletion of 
the telcmsric repeats durl^'g propagation of the plasmid. Consistent 
with this interpretati'", is the fact that pAtT4 was isolated as a 
stable derivative- oi a larger plasmid which suffered spontaneous 
deletions. Siniilar instability of simple sequence telomeric repeat 
clones from Trypanocctnas has been noted (see Blackburn et a1 . ,, Ceil 
36:447-457 (1984)). 

Ex ample 6 

Representation of pAtT4 Homologous Clones in the 
A. thai i ana Primary and Secondary Libraries 

The representation of telc.ieric clones in the A. thaliana primary 
and secondary libraries was determined by colony hybridization using 
the purified insert of pAtT4 as a probe. Approximately one out of 
5000 clones in each amplified primary library hybridized to the pAtT4 
insert probe (frequency of 2 X IQ-"*). Since each primary library was 
composed of approximately 5500 members, this result suggests that each 
of the original primary libraries contained a single telomeric clones. 
The secondary library, on the other hand, contained five out of a 
t-ot.^.l of 765 c'\^r\es which hybridized to the pAtT4 insert prob<_- 
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resulted in a roughly 30-fold enrichment for telomeric clones relative 
to the primary libraries. 

Example 7 

Isolation of Telomeric DNA Sequences from Yeast 
Using the Enrichment Strategy 

To determine the effectiveness of the telomeric cloning and 
enrichment strategy outlined in Figure 1, an experiment was undertaken 
to clonfi yeast telomsric sequences by this method. The results are 
presented in Table Ono. 







TABLE ONE 








Type of 
Library 


Vector 


Members 


% 

Inserts 


Members 

w/ 
Inserts 


# 

CA/GT 


Frsq 


Primary 


pSDClZ 


474 


50% 


240 


4 


0.017 


Primary 


pSDC13 


541 


55% 


350 


6 


0.017 


Secondary "A" 




164 


100% 


164 


28 


0.17 


Secondary "B" 




134 


nd 


<134 


27 


0.20 



The type (primary versus secondary) and size (number of member 
clones) of the yeast libraries are indicated. Secondary "A" and "B" 
denote two Independently assembled secondary libraries. The % insert 
was estimated from the study of 20 randomly selected clones (nd - not 
determined^. I CA/GT refers to the number of clones which hybridized 
to a radiolabeled poly d(CA/GT) probe, as determined by the colony 
hybridization method. The frequency of poly d(CA/GT) hybridizing 
clones in each library (Freq) was calculated by dividing # CA/GT by 
the number of membors with inserts. 
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The pSDClZ and pSDC13 yeast primary libraries contained 240 and 
350 clones with inserts, respectively. Approximately one out of sixty 
clones in both yeast primary libraries hybridized to a poly d(CA/(JT) 
probe that cross-hybridized to the yeast telomeric sequence (Ci.3A)n, 
as well as short stretches of {CA)^ found in the chromosome arms 
(Walmsley et al. . PNAS USA 82:506-510 (1983)). 

The following observations Indicate that most of the poly 
d(CA/GT) hybridizing clones contained telomeric sequence inserts: (1) 
9 out of 10 clones had the insert orientation predicted for cloning of 
telomeric restriction fragments (with asymmetric C-rich and G-rich 
strands) into the pSDClZ ?.nd pSDClB complementary vectors, (2) 
sequencing of one pSDC13 clone showed that the {Cj_3A)n sequence abuts 
the blunt end cloning site of the vector, (3) Southern blot experi- 
ments indicated that at least three clones contained inserts which 
cross-hybridized with telomere flanking sequences (data not shown). 

Two separate secondary libraries (designated A and 8) were 
assembled fr 'n the primary yeast libraries as described in Figure 1. 
As shown in Table One, the frequency of clones hybridizing with the 
poly d(CA/GT) probe was approximately 0.2 in both secondary libraries, 
indicating that a ten-fold enrichment in clones that hybridized to the 
poly d(CA/GT) probe was achieved. 

Example 8 

Identification of a Telotrisomic 
RFLP by the Telomeric Clone pAtT4 

To determine if pAtT4 would identify a telotrisomic RFLP, 
Southern blots of genomic DNA were prepared from telotrisomic mutant 
Tr5A and wild type disomic plants. Genomic DMA from the A. thaliana 

1 ! ► 1 : — ; - f ^ ^,-^prnir <;nhinns was prepared by 
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1:19-21 (1983). One feature of telotrisomic plants that is of par- 
ticular use in this experiment is that both disomic and telotrisomic 
plants appear in the progeny after selfing telotrisomic parents. 
Therefore, the telotrisomic progeny should be isogenic with their 
disoiric siblings except for the presence of the extra telocentric 
chromosome. Moreover, since the telocentric chromosome Is free to 
recombine with the full length chromosomes, and the telotrisomic 
stocks are maintained by selfing mutant plants, the only quaHtacive 
difference between the genomes of the telotrisomic and the disomic 
progeny should be the new telomere. 

Telotrisomic and normal disomic plants are distinguished on the 
basis of gross morphol,:jgy. Disomic plants are generally larger and 
have broad leaver, /hilc Tr5A telotrisomics are semi -dwarf plant with 
narrow and slightly serrated leaves. There is some overlap in the 
phenotypic norms; in particular, individual diploid plants that are 
growing slowly will occasionally be identified incorrectly as telo- 
trisomic. It is unlikely, however, that telotrisomic progeny will be 
mistaken for disomies. 

As shewn in Fig. 3B, pAtT4 hybridized to a 15 kb Dral restriction 
fragment in the Tr5A genome that is absent from the disomic genome. 
This novel restriction fragment corresponds to the functional telomere 
that was added next to the centromere. 

The experiment shown in Fig. 38 was repeated using DNA isolated 
from 14 individual progeny plants derived from selfing a single Tr5A 
parent. Of the 14 plants, 9 were identified as disomies; none of 
these plants contained the polyrr.crphic 15kb Dral r^arker idcntifitid by 
pAtT4. This result indicates that the RFLP was linked to the new 
telomere since this telomere is the only genetic marker that can not 
be crossed away from the telocentric chromosome onto the full length 
chromosomes. The remaining 5 plants were identified phenotypically as 
telotrisomics; cf thsss, 3 displayed the telotrisomic pattern defined 
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"telotrisomlc" plants which did not contain the marker were most 
likely Incorrectly phenotyped disomic plants and not a consequence of 
recombination between the new telomere and the polymorphic marker 
since such a recombination event was not detected from study of the 

disomic siblings. 

The correlation between the presence of the new telomere and the 
pAtT4 15kb Oral polymorphic marker, in conjunction with the exo- 
nuclease sensitivity of the pAtT4 homologous genomic sequences, arrues 
convincingly that pAtT4 hybridized to telomeric sequences in 
thaliana . 

Another feature of iha blot shown in Fig. 3B is of Interest. The 
heterodisperse bands in the Tr5A lane are slightly smaller than the 
corresponding bands in uhe disomic lane, indicating that the telomeres 
in the mutant are smaller. This size difference is not surprising 
since telomeres in lower eukaryotic systems are known to be dynamic 
structures which change length depending on the genetic background and 
growth conditions of the organism. 



Isolation of Plant Centromeres Using Cloned Plant Telomeres 

Once a telotrisomic RFLP is identified, a genomic library may be 
constructed to clone the restriction fragment representing the CEN- 
telomere fusion. Genomic DNA from the telotrisomic mutant is treated 
briefly with the exonuclease Bal31 to generate blunt ends at the 
telomere, as was done in the telomere cloning strategy (see Figure 1). 
The end-repaired genomic DNA is then digested with the restriction 
endonuclease which defines the RFLP, and the population of DNA mole- 
cules corresponding to the size of RFLP isolated.' The size-selected 
DNA is cloned into a circular episomal yeast vector that carries a 
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Burke et aK . Science 2M:806-812 (1987)). The desired CEN-telomere 
clone may be identified in the library by hybridization with the 
isolated A. thaliana telomere. Alternatively, an L._£oli vector hosl 
system can be used. 

Example 10 

Homology Between A. thaliana Telomere Clone and 
Telomeric Sequences in More Complex Plants 

To determine whether the telomeric repeats from A, thaliana were 
homologous to telomeric DMA sequences in other higher plants. Southern 
blots of genorric DN* were prepared from a variety of plant species. 
Genomic DNA preparation from all plant species other than A. thaliana 
was performed as described in Ausubel et al . . In: Cqrr, Prot. Molec. 
Biol, , pp. 2.'3 (New York: John Wiley i Sons) (1987) and incorporated 
herein by reference. These DNA sequences were probed with radio- 
labeled. pAtT4 and washed at high stringency (see Ausubel, ?upra) . The 
probe hybridized to genomic sequences from all plants tested including 
Brassica camoestris . Brassica carinata . Ranhanus sativus (Radish), 
LvcooerslcQn esculentum (Tomato), Nicotiana tabacum (Tobacco), 
Hedicaoo sativa (Alfalfa), and Zea mays (Corn). All of these plant 
species are dicots with the exception of the monocot I, mays. 

A. thaliana is a dicot and Z. mays represents an evolutionarily 
diverged plant. Thus, Z. mays represented an excellent test vehicle 
for determining the applicability of the teachings of the presant 
invention to higher plant species. 

To determine if the pAtT4 cross-hybridizing sequences in the Z^ 
mays genome were located at the telomeres, high molecular weight Z^ 
mays genomic DNA was treated with Bal31 for various lengths of time, 
digested with the restriction endonuclease Bell, size fractionated, 
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The pAtT4 hybridizrtion pattern Includes several bands larger 
than 14kb and a collection of incompletely resolved heterodisperse 
bands of lower molecular weight. The bands become less intense and 
display increased electrophoretic mobility upon increasing Bal31 
treatment. A faint 8kb band that did not change in size during 
exonuclease digestion can also be seen in Fig. 5A, demonstrating that 
sensitivity to Bal31 digestion is not a property of all genomic 
sequences. 

This experiment indicates that A. thaliana telomeric repeats 
cross-hybridize to telomeric sequences in Z. mays . 

Example 11 

Cross-Hybridation of A. thaliana Telomere Clone to 
Telomeric Sequences in the Human Genome 

Southern blots of the genomic DNA prepared from some well -studied 
animal model systems were probed with pAtT4 to determine if the A^ 
thai iana telomere repeat would cross-hybridize to animal telomeres. 

Human DNA was prepared by lysis of monolayers of HeLa cells with 
100 mM Tris-HCl pH 8.0, 50 mH EDTA, 500 mM NaCl , 1% SOS, and 100 /ig/ml 
Proteinase K. After incubation for 1 hour, the lysates were phenol - 
extracted twice and chloroform-extracted once. The genomic DNA was 
then collected by ethanol precipitation. The A. thaliana telomere 
probe hybridized to both human and Caenorhabditi s eleqans DNA under 
low stringency conditions, but pAtT4 did not hybridize to the D roso- 
p hila melanoqaster genome. 

To test if the pAtT4-cross-hybridizing sequences in the human 
genome were telomeric, pAtT4 was used to probe a Southern blot 
(Hindlll digest) of Bal31-treated human DNA. Figure 6A shows that 
nr.tT4 hvHridi/pH t n ^ hptprodi spPTSP band ranainn from 8.5 to 15kb, in 
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app^ared with increasing exonuclease digestion. Two faint bands of 
2.3 and 3.5kb are also seen in Figure 6A, which were not affected by 
the exonuclease treatment. 

As a control, the filter shown in Fig. 6A was stripped and 
reprobed with a Xenopus rDNA clone which cross-hybridized to human 
rRNA genes. Restriction fragments carrying the human rRNA genes do 
not change mobility during the exonuclease treatemtn, as shown in Fig. 
6B. 

These data suggest that most of the pAtT4 cross-hybridizing 
sequences of human DNA are telomeric. 

Example 12 

First Method of Isolation of Autonomous 
Replicating Sequence 

An inverted telomere repeat is integrated into an A. thali^na 
chromosome using the Agrobacterium/ T-DNA transformation system. Genes 
for different drug resistances are placed on either side of the telo- 
mere palindrome. After integration of this construct into an ^ 
thai ia na chromosome, a resolution reaction generates two chromosome 
fragments, both of which can be selected for by requiring that trans- 
formants to be resistant to both drugs. Since the average A. thaliana 
chromosome is only 20Mb long, random integration of the telomere 
palindrome leads to the formation of ^n isolatable (<lMb) acentric 
minichromosome In approximately 10% of the transformation events. The 
isolated minichromosome 13 further fragmented using in vitro cloning 
techniques and assayed for extrachromosomal replication in plant 
cells. 

In this way, the sequences comprising a functional plant replica- 
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Example 13 

Second Method of Isolation of Autonomous 
Replicating Sequences 

As a first step, a plasmid cloning vector is constructed with the 
following features: i) a plasmid replication origin and a selectable 
marker that functions in E. coli . ii) a selectable marker that func- 
tions in plant cells, iii) a cloning site(s) for insertion of plant 
genomic DNA fragments, and iv) an A. thaliana telomere inverted repeat 
separated by a stuffer fragment flanked by NotI sites (the stuffer 
fragment permitted replication of the plasmid as a circle in E. coll ). 

An A. thaliana genomic library is then constructed for this 
cloning vector and maintained in an f- coll host. Plasmid DNA pre- 
pared from this library is digested with the restriction endonuclease 
Not I to generate linear molecules with two telomeric ends. These 
linear DNA are then introduced into plant cells by direct DNA trans- 
formation (see Potrykus et al . . Plant Ho1 . Biol. Rptr. 3:117-128 
(1985)) and the transformants selected. 

Introduction of the DNA into plant cells as linear molecules with 
telomeric ends reduces the frequency of random integration into the 
chromosomes caused by recombinogenic, broken DNA ends formed during 
transfection (see, for example, Perrot et al . . Mol . Cell Biol . Z:1725- 
1730 (1987). Since extrachromosomally replicating DNA molecules 
without centromeres are mitotically unstable, screening for transfor- 
??.5nts which lose the marker under nonselective growth is required (see 
Hsiao et a1 . . Proc. Natl. Acad. Sci . USA 78:3760-3764 (1981)). Once 
unstable transformants are identified, extrachromosomal DNA is pre- 
pared to recover the linear minichromosomes . The ARS sequence 
responsible for extrachromosomal replication is then identified and 
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Examole 14 

Construction of Artificial Plant Chromosomes 

Plant artificial chromosomes are constructed by combining the 
previously isolated essential chromosomal elements. Various configur- 
ations of assembled artificial chromosomes are described below for 
purposes of illustration (see Figure 10). These artificial chrome- 
scfr.es designed to be "shuttle vectors"; they can be maintained in 
a convenient host (either E. coll or yeast) as well as plant cells. 

An artificial chrcitoso.'ne can be maintained in E. coli as a 
circular molecule by placiisg a removable stuffer fragment between the 
telomeric sequence bljcks (see Figure 10(A)). The stuffer fragment is 
a dispensible DNA sequence, bordered by unique restriction sites, 
which can be removed by restriction digestion of the circular ONAs to 
create linear molecules with telomeric ends. In addition to the 
stuffer fragment and the plant telomeres, tha artificial chromosome 
contains a replication origin and selectable marker that can function 
in E. coli to allow the circular molecules to be maintained in E. coll 
cells. The artificial chromosomes also include a plant selectable 
marker, a plant centromere, and a plant ARS to allow replication and 
maintenance of the DNA molecules in plant cells. Finally, tha 
artificial chromosome includes several unique restriction sites where 
additional DNA sequence inserts can be cloned. The most expeditious 
method of physically constructing such an artificial chromosome, i.e., 
ligatir.g the various essential elements together for example, will be 
apparent to those of ordinary skill in this art. 

Figure 10(B) illustrates an artificial chromosome that can be 
maintained as a circular molecule in yeast cells. Again, a stuffer 
fragment is included to separate the telomeric sequence blocks and 
farilitatp the reulication of the molecule as a circle. The artifi- 
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the DNA molecule in yeast, a yeast ARS, a yeast contromere, and yeast 
selectable marker are also included. 

Alternatively, a plant artificial chromosome is constructed that 
can be maintained as a linear molecule in yeast by utilizing the A^ 
thaliana tfclomeres as shown in Figure 10(C). 

The instant disclosure sets forth all essential information in 
connection with the invention. It has been described in some detai^ 
by way of illustration and example for purposes of clarity and ^-nd-jr- 
standing. It will be obvious that certain changes and ricdificaticn? 
may be practiced within the scope of the invention, as limited only by 
the scope of the appended claims. 
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yi^ftT TS CLAIMED IS: 

1. A recombinant DNA molecule which contains the telomere of a 
higher eukaryote. 

2. The recombinant DNA molecule of claim 1, wherein said higher 
eulcaryote is a plsnt. 

3. The recombinant DNA molecule of claim 2, wherein said plant 
<s ArabldoDSis thaliana . 

4. The recombinant DNA «^iolecule of claim 1, which additionally 
contains a centromere. 

5. The recombinant molecule of claim 1, which additionally 
contains an autonomous replicating saquence. 

5. The recombinant molecule of claim 1, which additionally 
contains a selectable marker gene. 

7. The recombinant DNA molecule of claim 1, which is capable of 
being maintained as a chromosome. 

8. The recombinant DNA molecule of claim 7, which is capable of 
being maintained as a chromosome in a prokaryote. 

9. The recombinant DNA molecule of claim 8, wherein said pro- 
karyote is a bacteria. 

10. The recombinant DNA molecule of claim 7, which is capable of 
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11. The recombinant DNA molecule of claim 10, wherein said 
eukaryote is a higher eukaryote. 

12. The recombinant DNA molecule of claim 11, wherein said 
higher eukaryote is a plant. 

13. The recombinant DNA molecule of claim U, wherein said 
higher eukaryote is a plant tissue culture cell. 

14. The recombir-ant DNA molecule of claim 7, which additionally 
contains a desired gane sequence. 

15. The r<»ccmbinait DNA molecule of claim 14, wherein said 
desired gena sequence is selected from the group consisting of: 

(1) a gene sequence of a hormone gene; 

{2) a gene sequence of an antibiotic resistance gene; 

(3) a gene sequence of a nitrogen fixation gene; 

(4) a gene sequence of a plant pathogen defense gene; 

(5) a gene sequence of a plant stress-induced gene; 

(6) a gene sequence of a toxin gene; and 

(7) a gene sequence of a seed storage gene. 

16. The recombinant DNA molecule of claim 15, wherein said 
recombinant molecule is capable of expressing said desired gene se- 
quence. 

17. The recombinant DNA molecule of claim 15, wherein said 
recombinant molecule is capable of expressing said dosired gene in a 
prokaryote. 



recombiiidnl moiecule capaDle evnrP.smu saic ue^ireu gene 



eukaryote . 
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19. The recombinant DNA ssolecule of claim 18, wherein said 
eukaryote is a higher eukaryote. 

20. The recombinant DNA molecule of claim 19, wherein said 
higher eukaryote is a plant. 

21. The recombinant DNA molecule of claim 19, wherein said 
higher eukaryote is a plant tissue culture cell. 

22. A method of producing a polypeptide in a recipient cell 

wnich comprises : 

(a) intrjdLiciny a gene sequence capable of producing said 
polypeptide into a recombinant molecule, said recombinant molecule 
containing a telomere of a higher eukaryote; 

(b) introducing said recombinant molecule containing said 
cloned gene sequence into a recipient cell; and 

(c) incubating said cell under conditions sufficient to 
permit the production of said polypeptide. 

23. Ths method of claim 22, wharein said recipient cell is a 
prokaryote. 

24. The method of claim 23, wherein said prokaryote is a bac- 
teria. 

25. The method of claim 22, wherein said recipient cell is a 
higher eukaryote. 
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27. The method of claim 25, wherein said higher eukaryote is a 
plant tissue culture cell. 

28. The recwBbinant DNA iBolecule of claim 1, wherein said telo- 
eere comprises tandemly repeated blocks of the base sequence 5'- 
[CCCTAAA]-3'. 

29. The recombinant DNA eolecule of claim 2S, wherein said 
telomere comprises variant repeats described by the base s-quence 5'- 
[CTCTAAA]-3'. ^ 

30. The racMbinant UNA molecule of claim ], wherein said telo- 
mere is the pAtT4 pUsmid, Accession Number ATCC 57577. 

31. A host cell transformed with the recombinant DNA molecule of 
claim 1. 

32. The host cell of claim 31, which is a eukaryotic cell. 

33. The host cell of cUim 32, which is a higher eukaryotic 

cell. 

34. the host cell of claim 33, which is a plant cell. 



35. A DNA molecule bearing the formula (CCCTAAA)r,, where 
n - 1 - 1000. 
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A/vGCTTG-oGCTGCAGfiicA CCCTAAA CCCTAAA CCCTi\AA 
CCCTAJU CCCTAAA CCCTAAA CCCTAAA CCCTAAA 
CCCT/vAA CCCTAAA CCCTAAA CCCTAAA CCCTAAA 
CCCTAAA CCCTAAA CCCTAAA CCCTAAA CCCTAAA> 
CCCTAAA CCCTAAA CCCTAAA CCCTAAA CCCTAAA 
CCCTAAA CCCTAAA CCCTAAA CCCTAAA CCCTAAA 
CCCTAAA CCCTAAA CCCTAAA CTCTAAA CTOTAAA 
CCCTAAA CCCTAAA CCCTAAA CCCTAAA CCCTAAA 
CCCTAAA CCCTAAA CCCTAAA CCCTAAA CCCTAAA 
CCCTAAA CCCTAAA CCCTAAA CCCTAAA CCCTAAA 
CCCTAAA CCCTAAA CCCTAAA CCCTAAA CCCTAAA 
CCCTAAA CCCTAAATAAAGCGCTGTGGiiAICcccGGGCGA 

Mbol 

FIG.4 
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Making small minichromosomes: Fragmenting fulliength chromosomes 
through resolution of integrated telomere inverted repeats 



Step One: Start with a full length chromosome 

telomere centromere telomere 

<3.^ O ■ 

L chromosome "arm" containing i 

' "genes and replication origins 



Step Two: Construct a telomere inverted repeat, flanked by two 
different drug resistance markers in T-DNA veciof 

telomere 
p Inverted p 
drug a" repeat drugs" 




StepThree: Intoduce the telomere inverted repeat construct into a 
plant chromosome 





chromosome with unresolved telomere 
palindrome 

Step Four: Screen for resolution of the telomere inverted repeat by 

Southern blotting experiments 

<] — — _o— — ==o 

and 



=!> 



omall acentric chromosome 
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FIG.8 

Functional selection for Plant ARSs: Shotgun cloning 

Step One: Start with a cloning vector of this general design 

Not! NotI 

telomere stuffer telomere 

— finmTn — m^Mm^ 
plant cloning Eyoji 

and. 

morker 

StepTw'O: Construct a plant genomic library in this vector: 
maintoifi in E . col? 

Step Three: Prepare plasmid DNA from this library; I inearize with NotI 




genomic plant 
DNA insert 
ARS??? 



Step Four: Direct DNA transformation into plant protoplasts; select 
transformants as caiius 



Step Five: Screen for instability of plant marker under nonselective 
qrowth. evidence for extrachromosomal maintenonrp 

Step Six; Prepare extrachromosomal DNA and recover plant ARS 
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Cloning Plant Centromeres using Telomerlc probes and Telotrisomic 
mutonts 

Step One: Prepare DNA from telotrisomic mutant and normal disomfc plants 



CEN-TELO 



CEN 



TCLO 



telotrisomic 



wild type disomic 



Step Two: Do a genomic Southern blot, probe with radiolabeled telomere; 
Look for RFLPs 

Telofrisomic Disomic 



RFLP- 
corrssponding to 
rne telomere 
centromere 

fusion 







Southern Blot probed 






with radiolabeled 


IT ?"f^ 




telomere 
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Step Three: Purify end-repaired, restricted genomic telotrisomic DNA 
corresponding to CEN-TEID RFLPj construct a library in o 
yeast ARS CEN vector 



Step Four; Screen the library using the telomere probe to find clones 
containing the CEN-TELO fusion fragment 
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